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Abstract. In this paper, we extend Meek's conjecture ([Meekl . 119971 ) from directed and 
acyclic graphs to chain graphs, and prove that the extended conjecture is true. Specifically, 
we prove that if a chain graph H is an independence map of the independence model induced 
by another chain graph G, then (i) G can be transformed into H by a sequence of directed 
and undirected edge additions and feasible splits and mergings, and (ii) after each operation 
in the sequence H remains an independence map of the independence model induced by G. 
Our result has the same impor tant consequence for learning chain graphs from data as the 
proof of Meek's conjecture in ( Chickerina 12002 ) had for learning Bayesian networks from 



data: It makes it possible to develop efficient and asymptotically correct learning algorithms 
under mild assumptions. 



1. Preliminaries 

In this section, we review some concep ts from probabilis tic gra phical m o dels t hat are used 
later in this paper. See, for instance, (ILauritzenl . Il996h and ( jStudenyl . 120051 ) for further 
information. All the graphs and independence models in this paper are defined over a finite 
set V. All the graphs in this paper are hybrid graphs, i.e. they have (possibly) both directed 
and undirected edges. We assume throughout the paper that the union and the intersection 
of sets precede the set difference when evaluating an expression. 

If a graph G has a directed (resp. undirected) edge between two nodes X 1 and X 2 , then 
we write that Xi —> X 2 (resp. X\ — X 2 ) is in G. When there is a directed or undirected edge 
between two nodes of G, we say that the two nodes are adjacent in G. The parents of a set 
of nodes Y of G is the set PaciY) = {Xi\X\ — > X 2 is in G and X 2 G Y}. The neighbors of a 
set of nodes Y of G is the set Nec{Y) = {X\\X\ — X 2 is in G and X 2 G Y}. The boundary 
of a node X 2 of G is the set Bdc(X 2 ) = Po>g(X 2 ) U Neo{X 2 ). A route between two nodes 
Xi and X n of 6? is a sequence of nodes Xi, . . . , X n st Xi and X i+ i are adjacent in G for all 
1 < i < n. The length of a route is the number of (not necessarily distinct) edges in the 
route. We treat all singletons as routes of length zero. A route in G is called undirected if 
Xi — X i+ i is in G for all 1 < % < n. A route in G is called descending from Xi to X n if 
Xi — X i+ i or Xi — > X i+ i is in G for all 1 < % < n. If there is a descending route from Xi to 
X n in G, then X n is called a descendant of Xi. Note that Xi is a descendant of itself, since 
we allow routes of length zero. The descendants of a set of nodes Y of G is the union of the 
descendants of each node of Y in G. Given a route p between Xi and X n in G and a route p' 
between X n and X m in G, p U p' denotes the route between Xi and X m in G resulting from 
appending p' to p. 

A chain is a partition of V into ordered subsets, which we call the blocks of the chain. We 
say that an element X G V is to the left of another element Y G V in a chain a if the block 
of a containing X precedes the block of a containing Y in a. Equivalently, we can say that 
Y is to the right of X in a. We say that a graph G and a chain a are consistent when (i) 
for every edge X — > Y in G, X is to the left of Y in a, and (ii) for every edge X — Y in G, 
X and Y are in the same block of a. A chain graph (CG) is a graph that is consistent with 
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a chain. A set of nodes of a CG is connected if there exists an undirected route in the CG 
between every pair of nodes of the set. A component of a CG is a maximal (wrt set inclusion) 
connected set of its nodes. A block of a CG is a set of components of the CG st there is no 
directed edge between their nodes in the CG. Note that a component of a CG is connected, 
whereas a block of a CG or a block of a chain that is consistent with a CG is not necessarily 
connected. Given a set K of components of G, a component G G K is called maximal in G 
if none of its nodes is a descendant of K \ {C} in G. A component C of G is called terminal 
in G if its descendants in G are exactly C. Let a component C of G be partitioned into two 
non-empty connected subsets C\L and L. By splitting C into C\L and L in G, we mean 
replacing every edge A — FinGstAGG\L and F G L with an edge X — > Y. Moreover, 
we say that the split is feasible if (i) X — Y is in G for all X, Y G Nec(L) D (C\ L), and (ii) 
A -»■ F is in G for all X G Pa G (L) and F G Ae G (L) n (C \ L). Let L and R denote two 
components of G st Pac(R) fl L 7^ 0. By merging L and i? in G, we mean replacing every 
edge X — )• F in G st A G L and F G i? with an edge X — Y. Moreover, we say that the 
merging is feasible if (i) X — Y is in G for all X, Y G Pac(R) D L, and (ii) A — >■ F is in G 
for all A G Pa G (#) \ L and F G Pa G (#) H L. 

A section of a route p in a CG is a maximal undirected subroute of p. A section A2 — 
. . . — A n _i of p is a collider section of p if Ai — > X2 — ■ ■ ■ — A n _i <r- A„ is a subroute of p. 
Moreover, the edges X\ —> A 2 and A n _i A n are called collider edges. Let A, F and Z 
denote three disjoint subsets of V. A route p in a CG is said to be Z-active when (i) every 
collider section of p has a node in Z, and (ii) every non-collider section of p has no node in 
Z. When there is no route in a CG G between a node of A and a node of F that is Z-active, 
we say that A is separated from F given Z in G and denote it as A_L G F|Z. We denote by 
Xl G Y\Z that Ai_ G F|Z does not hold. 

Let A, F, Z and IF denote four disjoint subsets of V. An independence model M is a 
set of statements of the form A _L mY\Z, meaning that A is independent of F given Z. 
Given two independence models M and N, we denote by M C N that if A _L mY\Z then 
A _L a?F|Z. We say that M is a graphoid if it satisfies the following properties: Symmetry 
A _L M Y\Z F _L M X\Z, decomposition A _L M Y U W\Z X i_ M Y\Z, weak union 
Ai_ A/ FlW|Z => X± M Y\ZUW, contraction X ± M Y\ZUWAX ± M W\Z => A_L M YUW\Z, 
and intersection A _L M Y\Z UW AX ± M W\Z U F =^ A _L M F U W\Z. The independence 
model induced by a CG G, denoted as /(G), is the set of sepa r ation statements A_L G F|Z. 
It is known that I{G) is a graphoid (IStudenv and Bouckaertl . Il998l . Lemma 3.1). Let H 



denote the gr aph resulting from a f easible split or merging in a CG G. Then, if is a CG and 
I{H) = 1(G) flStudenv et all 120091 . Lemma 5 and Corollary 9). 



A CG G is an independence (I) map of an independence model M if I{G) C M. Moreover, 
G is a minimal independence (MI) map of M if removing any edge from G makes it cease 
to be an I map of M. Given any chain C\, . . . , C n that is consistent with G, we say that G 
satisfies the pairwise block-recursive Markov property wrt M if A_L a/F| Uy =1 Cj \ {A, F} for 
all non-adjacent nodes A and F of G and where k* is the smallest k st A, F G U^ =1 Gj. If M 
is a graphoid an d G satisfies the pairwise block-recursive Markov property wrt M, then G is 



an I map of M (iLauritzenl . Il996l . Theorem 3.34). We say that a CG G a is a MI map of an 
independence model M relative to a chain a if G a is a MI map of M and G a is consistent 
with a. 

2. Extension of Meek's Conjecture to Chain Graphs 

Given two directed and acyclic graphs G and H st 1(H) C /(G), Meek's conjecture states 
that we can transform G into H by a sequence of arc additions and covered arc reversals 
st afte r each operation in the sequence G is a directed and a cyclic graph and 1(H) C /(G) 
flMeeki . Il997h . Meek's conjecture was proven to be true in (jChickeringl . |2002| . Theorem 4) 



by developing an algorithm that constructs a valid sequence of operations. In this section, 
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Fbsplit(A, L, G) 

/* Given a block A of a CG G and a subset A of A, the algorithm repeatedly splits a 
component of G until A becomes a block of G. Before the splits, the algorithm adds to G 
the smallest set of edges so that the splits are feasible */ 

1 Let L%, . . . ,L n denote the maximal connected subsets of A in G 

2 For i = 1 to n do 

3 Add an edge X - Y to G for all X, Y G Ne G (U) n (K \ L) 

4 Add an edge X -> Y to G for all A G Pa G (U) and Y G Nee (Li) n (A \ A) 

5 For i = 1 to n do 

6 Let Aj denote the component of G st Aj C Aj 

7 If Aj \ A / then 

8 Split Kj into Aj \ Lj and Lj in G 

Fbmerge(L, A, G) 

/* Given two blocks L and A of a CG G, the algorithm repeatedly merges two components 
of G until L U R becomes a block of G. Before the mergings, the algorithm adds to G the 
smallest set of edges so that the mergings are feasible */ 

1 Let Ri, . . . , R n denote the components of G that are in R 

2 For i = 1 to n do 

3 Add an edge X - Y to G for all A, Y G Pa G (Ri) n L 

4 Add an edge X -> Y to G for all A G Pa G (Ri) \ L and Y G Pa G {Ri) n L 

5 For i = 1 to n do 

6 Let Lj denote the component of G st Lj C L U A and Pa G (Ri) n A,- ^ 

7 If A, ^ then 

8 Merge Lj and A^ in G 



Figure 1. Fbsplit and fbmerge. 

we extend Meek's conjecture from directed and acyclic graphs to CGs, and prove that the 
extended conjecture is true. Specifically, given two CGs G and H st 1(H) C 1(G), we 
prove that G can be transformed into if by a sequence of directed and undirected edge 
additions and feasible splits and mergings st after each operation in the sequence G is a CG 
and 1(H) C /(G). The proof is constructive in the sense that we give an algorithm that 
constructs a valid sequence of operations. 

We start by introducing two new operations on CGs. It is worth mentioning that all the 
algorithms in this paper use a "by reference" calling convention, meaning that the algorithms 
can modify the arguments passed to them. Let K denote a block of a CG G. Let LCif. By 
feasible block splitting (fbsplitting) K into K\L and L in G, we mean running the algorithm 
at the top of Figure [TJ The algorithm repeatedly splits a component of G until L becomes a 
block of G. Before the splits, the algorithm adds to G the smallest set of edges so that the 
splits are feasible. Let L and R denote two blocks of a CG G. By feasible block merging 
(fbmerging) L and R in G, we mean running the algorithm at the bottom of Figure [TJ The 
algorithm repeatedly merges two components of G until L U R becomes a block of G. Before 
the mergings, the algorithm adds to G the smallest set of edges so that the mergings are 
feasible. It is worth mentioning that the component Lj in line 6 is guaranteed to be unique 
by the edges added in lines 3 and 4. 
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Construct ft(G, a, ft) 

/* Given a CG G and a chain a, the algorithm derives a chain /3 that is consistent with G 
and as close to a as possible */ 

1 Set ft = 

2 Set H = G 

3 Let C denote any terminal component of H whose leftmost node in a is rightmost in a 

4 Add C as the leftmost block of ft 

5 Let R denote the right neighbor of C in ft 

6 If R ^ 0, Pclg(R) n C = 0, and the nodes of C are to the right of the nodes of R in a then 

7 Replace C, R with R,C hx ft 

8 Go to line 5 

9 Remove C and all its incoming edges from H 

10 If H ^ then 

11 Go to line 3 

Method B3(G, a) 

/* Given a CG G and a chain a, the algorithm transforms G into G Q */ 

1 Construct j3{G, a, ft) 

2 Let C denote the rightmost block of a that has not been considered before 

3 Let K denote the leftmost block of ft st K n C ^ 

4 Set L = if n C 

5 If iT \ L then 

6 Fbsplit(if, L, G) 

7 Replace K with K \L,L in /3 

8 Let -R denote the right neighbor of L in ft 

9 If i? ^ and some node of R is not to the right of the nodes of L in a 

10 Fbmerge(L, R, G) 

11 Replace L, R with L U i? in ft 

12 Go to line 3 

13 If ft ^ a then 

14 Go to line 2 



Figure 2. Method B3. 

Our proof of the extension of Meek's conjecture to CGs builds upon an algorithm for 
efficiently deriving the MI map G a of the independence model induced by a given CG G 
relative to a given chain a. The pseudocode of the algorithm, called Method B3, can be 
seen in Figure [2J Method B3 works iteratively by fbsplitting and fbmerging some blocks of 
G until the resulting CG is consistent with a. It is not difficult to see that such a way of 
working results in a CG that is an I map of 1(G). However, in order to arrive at G a , the 
blocks of G to modify in each iteration must be carefully chosen. For this purpose, Method 
B3 starts by calling Construct ft to derive a chain that is consistent with G and as close to 
a as possible (see lines 5-8). By ft being as close to a as possible, we mean that the number 
of blocks Method B3 will later fbsplit and fbmerge is kept at a minimum, because Method 
B3 will use ft to choose the blocks to modify in each iteration. A line of Construct ft that is 
worth explaining is line 3, because it is crucial for the correctness of Method B3 (see Case 
3.2.4 in the proof of Lemma H]). This line determines the order in which the components of 
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H (initially H = G) are added to (initially j3 = 0). In principle, a component of H may 
have nodes from several blocks of a. Line 3 labels each terminal component of H with its 
leftmost node in a and, then, chooses any terminal component whose label node is rightmost 
in a. This is the next component to add to 0. 

Once /3 has been constructed, Method B3 proceeds to transform G into G a . In particular, 
Method B3 considers the blocks of a one by one in the reverse order in which they appear 
in a. For each block C of a, Method B3 iterates through the following steps. First, it finds 
the leftmost block K of (3 that has some nodes from C. These nodes, denoted as L, are then 
moved to the right in /3 by fbsplitting K to create a new block L of G and ft. If the nodes of 
the right neighbor R of L in /3 are to the right of the nodes of L in a, then Method B3 is done 
with C. Otherwise, Method B3 moves L further to the right in /3 by fbmerging L and R in 
G and /3. We prove below that Method B3 is correct. We prove first some auxiliary results. 

Lemma 1. Let M denote an independence model, and a a chain Ci, . . . ,C n . If M is a 
graphoid, then there exits a unique CG G a that is a MI map of M relative to a. Specifically, 
for each node X of each block Ck of a, Bd Ga (X) is the smallest subset B ofU^ =1 Cj \ {X} st 
X± M \J* =1 C j \{X}\B\BR 

Proof. Let X and Y denote any two non-adjacent nodes of G a . Let k* denote the smallest 
k st X, Y G Uj =1 Cj. Assume without loss of generality that X G Ck*. Then, X JL m U*1i 
Cj \ {X} \ Bd Ga (X)\Bd Ga (X) by construction of G a and, thus, X± M Y\ Uf =l Cj \ {X,Y} 
by weak union. Then, G a satisfies the pairwise block-recursive Markov property wrt M and, 
thus, G a is an I map of M. In fact, G a is a MI map of M by construction of Bd Ga (X). 

Assume to the contrary that there exists another CG H a that is a MI map of M relative 
to a. Let X denote any node st Bd Ga (X) ^ Bd Ha (X). Let X G C k . Then, X± M U*? =1 Cj \ 
{X}\Bd Ga (X)\Bd Ga (X) X L M a vj k j=l C 3 \{X}\Bd Ha {X)\Bd Ha {X) because G a and H a 
are MI maps of M. Then, X ± M Uj =1 Cj \{X}\ Bd Ga (X) n Bd Ha (X)\Bd Ga (X) n Bd Ha (X) 
by intersection. However, this contradicts the construction of Bd Ga (X), because Bd Ga (X) fl 
Bdn a {X) is smaller than Bd Ga (X). 

□ 

Lemma 2. Let G and H denote two CGs st 1(H) C 1(G). For any component C of G, there 
exists a unique component of H that is maximal in H from the set of components of H that 
contain a descendant of C in G. 

Proof. By definition of CG, there exists at least one such component of H . Assume to the 
contrary that there exist two such components of H, say K and K' . Note that Pan(K)^K' = 
and Pan(K') H K = by definition of K and K' . Note also that no node of K or Pan(K) 
is a descendant of K' in H by definition of K. This implies that K' _L hK U Pan(K) \ 
PaH(K')\PaH(K') and, thus, K A-hK'\Pclh(K) U Pan(K') by weak union and symmetry. 

That K and K' contain some descendants k and k' of C in G implies that there are 
descending routes from C to k and k' in G st the nodes in the routes are descendant of C in 
G. Thus, there is a route between k and k' in G st the nodes in the route are descendant of 
C in G. Note that no node in this route is in Pan(K) or Pan(K') by definition of K and K' . 
Then, K JL G K'\Pa H (K) U Pa H (K'). However, this contradicts the fact that 1(H) C 1(G) 
because, as shown, K _L hK'\Pch(K) U Pan(K'). 

□ 

Lemma 3. Let G and H denote two CGs st 1(H) C 1(G). Let a denote a chain that is 
consistent with H . If no descendant of a node X in G is to the left of X in a, then the 
descendants of X in G are descendant of X in H too. 



x By convention, X_L M 0| uj? =1 Cj \ {X}. 



6 



Proof. Let D denote the descendants of X in G. Let G denote the component of G that 
contains X. Note that the descendants of C in G are exactly the set D. Then, there exists a 
unique component of H that is maximal in H from the set of components of H that contain 
a node from D, by Lemma [2J 

Let K denote the component of H that contains X. Note that K is a component of H 
that is maximal in H from the set of components of H that contain a node from D, since no 
node of D is to the left of X in a. It follows from the paragraph above that K is the only 
such component of H . 

□ 

We are now ready to prove the correctness of Method B3. 

Lemma 4. Let G a denote the MI map of the independence model induced by a CG G relative 
to a chain a. Then, Method B3(G, a) returns G a . 

Proof. We start by proving that Method B3 halts at some point. When Method B3 is done 
with the rightmost block of a, the rightmost block of contains all and only the nodes of the 
rightmost block of a. When Method B3 is done with the second rightmost block of a, the 
rightmost block of (3 contains all and only the nodes of the rightmost block of a, whereas the 
second rightmost block of (3 contains all and only the nodes of the second rightmost block of 
a. Continuing with this reasoning, one can see that when Method B3 is done with all the 
blocks of a, /3 coincides with a and thus Method B3 halts. 

That Method B3 halts at some point implies that it performs a finite sequence of m 
modifications to G due to the fbsplit and fbmerging in lines 6 and 10. Let Gt denote the 
CG resulting from the first t modifications to G, and let Go — G. Specifically, Method B3 
constructs G t +i from G t by either 

• adding an edge X — Y due to line 3 of Fbsplit or Fbmerge, 

• adding an edge X — >■ Y due to line 4 of Fbsplit or Fbmerge, 

• performing all the component splits due to lines 5-8 of Fbsplit, or 

• performing all the component mergings due to lines 5-8 of Fbmerge. 

Note that none of the modifications above introduces new separation statements. This 
is trivial to see for the first and second modification. To see it for the third and fourth 
modification, recall that the splits and the mergings are part of a fbsplit and a fbmerging 
respectively and, thus, they are feasible. Therefore, I(G t +i) C I{G t ) for all < t < m and, 
thus, I(G m ) C /(Go). 

We continue by proving that Gt is consistent with (3 for all < t < m. Since this is true 
for G due to line 1, it suffices to prove that if it is true for G t then it is true for G t+ % for all 
< t < m. We consider the following four cases. 

Case 1: Method B3 constructs Gt+i from Gt by adding an edge X — Y due to line 3 of 
Fbsplit or Fbmerge. It suffices to note that X and Y are in the same block of Gt and 
B. 

Case 2: Method B3 constructs G t +i from G t by adding an edge X — > Y due to line 4 
of Fbsplit. It suffices to note that X is to the left of Y in /3, because G t is consistent 
with (3. 

Case 3: Method B3 constructs G t +i from G t by adding an edge X — > Y due to line 4 
of Fbmerge. Note that X is to the left of R in /3, because /3 is consistent with Gt- 
Then, X is to the left of L in (3, because L is the left neighbor of R in (3 and X ^ L. 
Then, X is to the left of Y in (3, because Y G L. 

Case 4: Method B3 constructs Gt+i from Gt by either performing all the component 
splits due to lines 5-8 of Fbsplit or performing all the component mergings due to 
lines 5-8 of Fbmerge. Note that the splits and the mergings are feasible, since they 
are part of a fbsplit and a fbmerging respectively. Therefore, G t +i is a CG. Moreover, 
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note that (3 is modified immediately after the fbsplit and the fbmerging so that it is 
consistent with G t +i- 

Note that G m is not only consistent with (3 but also with a because, as shown, (3 coincides 
with a when Method B3 halts. In order to prove the lemma, i.e. that G m = G a , all that 
remains to prove is that I(G a ) Q I(G m ). To see it, note that G m = G a follows from 
I (G a ) C I(G m ), I(G m ) C I (Go), the fact that G m is consistent with a, and the fact that G a 
is the unique MI map of /(Go) relative to a. Recall that G a is guaranteed to be unique by 
Lemma [TJ because /(Go) is a graphoid. 

The rest of the proof is devoted to prove that I(G a ) C I(G m ). Specifically, we prove that 
if I(G a ) C I(G t ) then I(G a ) C J(G m ) for all < t < m. Note that this implies that 
I (Ga) Q I{G m ) because I(G a ) C I (Go) by definition of MI map. First, we prove it when 
Method B3 constructs G t +i from G t by either performing all the component splits due to lines 
5-8 of Fbsplit or performing all the component mergings due to lines 5-8 of Fbmerge. Note 
that the splits and the mergings are feasible, since they are part of a fbsplit and a fbmerging 
respectively. Therefore, I(G t +i) = I(G t ). Thus, I(G a ) Q I(G t +i) because I(G a ) Q I(Gt)- 

Now, we prove that if I(G a ) C I(G t ) then I(G a ) Q I(G t +i) when Method B3 constructs 
Gf + i from G t by adding a directed or undirected edge due to lines 3 and 4 of Fbsplit and 
Fbmerge. Specifically, we prove that if there is an S-active route p^ between two nodes A 
and B in G t +i, then there is an S'-active route between A and B in G a . We prove this result 
by induction on the number of occurrences of the added edge in pf+ v We assume without 
loss of generality that the added edge occurs in pf_^ as few or fewer times than in any other 
S'-active route between A and B in Gt+i- We call this the minimality property of p^S- If 
the number of occurrences of the added edge in p^[ is zero, then pf^ is an S-active route 
between A and B in Gt too and, thus, there is an S-active route between A and B in G a since 
I (G a ) Q I(Gt)- Assume as induction hypothesis that the result holds for up to n occurrences 
of the added edge in pf+]_. We now prove it for n + 1 occurrences. We consider the following 
four cases. 

Case 1: Method B3 constructs G t +i from Gt by adding an edge X — Y due to line 3 
of Fbsplit. Note that X — Y occurs in p^Jj Assume that X — Y occurs in a collider 
section of pf+\- Note that X and Y must be in the same component of G t for line 
3 of Fbsplit to add an edge X — Y . This component also contains a node Z that is 
in S because, otherwise, would not be S-active in G 4+ iH Note that there is a 
route X — — Z — — Yin G t - Then, we can replace any occurrence of X — Y 
in a collider section of p^ with X — — Z — — Y, and thus construct an S'- 
active route between A and B in G t +i that violates the minimality property of pfR- 
Since this is a contradiction, X — Y only occurs in non-collider sections of pf+\- Let 
pf+i = pt+i UI-VU p™. Note that X, Y ^ S because, otherwise, pf+ x would not 
be S-active in G t+1 . For the same reason, pf_^ and pj^{ are S-active in G t +i- Then, 
there are S-active routes p^ x and p\ B between A and X and between Y and B in 
G a by the induction hypothesis. 

Let X — X' — ... — Y' — Y be a route in G t st the nodes in X' — ... — Y' are in 
Such a route must exist for line 3 of Fbsplit to add an edge X — Y . Note that X and 
X' are adjacent in G a since I(G a ) C I(G t ). In fact, X — > X' is in G a . To see it, recall 
that Method B3 is currently considering the block G of a, and that it has previously 
considered all the blocks of a to the right of G in a. Then, K only contains nodes 
from G or from blocks to the left of G in a. However, X ^ G because X e K \ L 
and L = K fl G. Then, X is to the left of G in a. Thus, X — Y X' is in G a because 

2 Note that maybe A = X and/or Y = B. 
3 Note that maybe Z = X or Z = Y. 
4 Note that maybe X' = Y'. 



X' G L C C. Likewise, Y — > Y' is in G a . Note also that X' — ... — Y' is in G a . To see 
it, note that the adjacencies in X' — ... — Y' are preserved in G a since I{G a ) C I{G t ). 
Moreover, these adjacencies correspond to undirected edges in G a , because the nodes 
in X' — ... — Y' are in L and thus in the same block of G a , since L C C. Furthermore, 
a node in X' — ... — Y' is in S because, otherwise, p^ UX-I'-...-F'-FU p^ 
would be an S-active route between A and B in that would violate the minimality 
property of pf+i- Then, p^ x Ul->I'-...-l"<-FU p Y a B is an S-active route 
between A and B in G a . 

Case 2: Method B3 constructs Gt+i from G t by adding an edge X — > Y due to line 
4 of Fbsplit. Note that X — > Y occurs in p^H Assume that X — y Y occurs 
as a collider edge in p^u, i.e. X — > Y occurs in a subroute of p^ of the form 
X ->• Y - . . . - Z WU Note that a node in Y - . . . — Z is in S because, otherwise, 
Pt+i would not be 5- active in G t +i- Let X — > X' — ... — Y' — Y be a route in 
Gt st the nodes in X' — ... — Y' are in Such a route must exist for line 4 of 
Fbsplit to add an edge X — > Y . Then, we can replace X — > Y — — Z ■h- W 
with X — > X' — ... — Y' — Y — ... — Z ^— W m. p^, and thus construct an S- 
active route between A and B in Gt+i that violates the minimality property of pf+i- 
Since this is a contradiction, X — ?• K never occurs as a collider edge in p^. Let 
pf+i = pt+i U X -)■ V U p^. Note that X,Y ^ S because, otherwise, p^ would not 
be 5- active in G t +i- For the same reason, p^\ and p^ are S'-active in G t +\- Then, 
there are S'-active routes p^ x and p YB between A and X and between Y and B in 
G Q by the induction hypothesis. 

Let X — > X' — . . . — Y' — Y denote a route in Gt st the nodes in X' — ... — Y' are 
in lJI Such a route must exist for line 4 of Fbsplit to add an edge X — > Y. Note 
that X' — ... — Y' is in G a . To see it, note that the adjacencies in X' — ... — Y' 
are preserved in G a since I(G a ) C I(G t ). Moreover, these adjacencies correspond to 
undirected edges in G a , because the nodes in X' — ... — Y' are in L and thus in the 
same block of G a , since LCC. Furthermore, a node in X' — ... — Y' is in S because, 
otherwise, p^ Ul4l'-...-F'-FU pj_^ would be an S'-active route between 
A and B in G t +i that would violate the minimality property of p^. Moreover, note 
that X and X' are adjacent in G a since I(G a ) C I(G t ). In fact, X — )• X' is in G Q . To 
see it, recall that Method B3 is currently considering the block C of a, and that it has 
previously considered all the blocks of a to the right of C in a. Then, no block to the 
left of K in (3 has a node from C or from a block to the right of C in a. Note that X 
is to the left of K in /3, because /3 is consistent with G t - Thus, X — >■ X' is in Gq, since 
X' G L C C. Likewise, note that Y 7 and K are adjacent in G a since I(G a ) C I(G t )- 
In fact, F' -<— y is in G Q . To see it, note that X only contains nodes from C or from 
blocks to the left of C in a. However, Y ^ C because Y G K \ L and L = K n C. 
Then, Y is to the left of C in a. Thus, Y' <— Y is in G a because Y' G L C C. Then, 
p^ x UX-^X'-.-.-Y'^-YU p^ B is an S- active route between A and I? in G a . 

Case 3: Method B3 constructs G t +i from G t by adding an edge X — Y due to line 3 of 
Fbmerge. Note that X — Y occurs in p^. We consider two cases. 

Case 3.1: Assume that X — Y occurs in a collider section of pfBy- Let pf B x = 
Pt^-y U Z — )■ X' — ... — X — Y — ... — Y' <— W U p%_f E Note that Z, W £ S 
because, otherwise, p^ would not be S-active in Gt+i- For the same reason, 



Note that maybe A = X and/or Y = B. 
Note that maybe Y = Z and/or = X. 
Note that maybe X 1 = Y' . 
Note that maybe X' = Y' . 

Note that maybe A = Z,X' = X,Y'=Y,W = Z and/or W = B. 
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and p^f are S-active in G t +i- Then, there are S-active routes p„ and pjf 
between A and Z and between W and B in G a by the induction hypothesis. 
Let Ri denote the component of G t in R that Fbmerge is processing when the 
edge X — Y gets added. Recall that Method B3 is currently considering the block 
C of a, and that it has previously considered all the blocks of a to the right of C 
in a. Then, Ri only contains nodes from C or from blocks to the left of C in a. In 
other words, Ri C UjliCj \ {X, Y} where C fc * = C (recall that X, Y G L C C). 
Therefore, X / G( Y| Ujl x Cj \ {X, Y} because X and Y must be in Pa Gt {Ri) 
for line 3 of Fbmerge to add an edge X — Y. Then, X and Y are adjacent in 
G a because, otherwise, X _L Ga Y\ ^Jf=i \ {X,Y} which would contradict that 
I(G a ) C I(G t ). In fact, X — Y is in G a because X and y are in the same block 
of a, since X, Y G L C C 

Note that X' — ... — X and Y — . . . — Y' are in G a . To see it, note that the 
adjacencies in X' — ... — X and Y — . . . — Y' are preserved in G a since I(G a ) C 
I{G t ). Moreover, these adjacencies correspond to undirected edges in G a , because 
the nodes in X' — ... — X and Y — . . . — Y' are in L since 1,7 e I and, thus, 
they are in the same block of G a since L C C. Then, X' — ... — X — Y — ... — Y' 
is in G a . Furthermore, a node in X' — ... — X — Y — ... — Y' is in 5 because, 
otherwise, p^ would not be S'-active in G t +i- Note also that Z and X' are 
adjacent in G a since I(G a ) C I{G t ). In fact, Z — >■ X' is in G a . To see it, 
recall that Method B3 is currently considering the block C of a, and that it 
has previously considered all the blocks of a to the right of C in a. Then, no 
block to the left of L in has a node from C or from a block to the right of 
C in a. Note that Z is to the left of L in /3, because /3 is consistent with G t . 
Thus, Z — >■ X' is in G a since X' e L C C. Likewise, F' W is in G Q . Then, 
p£ z UZ -»■ X' - . . .-X - Y - . . ^- WU p^ B is an S'-active route between 

A and -B in G a . 

Case 3.2: Assume that X — Y occurs in a non-collider section of p^. Note that 
this implies that G t has a descending route from X to A or to a node in S, or 
from Y to -B or to a node in S. Assume without loss of generality that Gt has a 
descending route from Y to S or to a node in S. 

Let Ri denote the component of Gt in R that Fbmerge is processing when the 
edge X — Y gets added. Let Ly denote the component of G t that contains the 
node Y. Let D denote the component of G a that is maximal in G a from the set 
of components of G a that contain a descendant of Ly in G t . Recall that D is 
guaranteed to be unique by Lemma [21 because I(G a ) C I{G t ). We now show 
that some d G D is a descendant of i?j in G^. We consider four cases. 

Case 3.2.1: Assume that D n 7^ 0. It suffices to consider any d G 
To see it, recall that Method B3 is currently considering the block C of 
a, and that it has previously considered all the blocks of a to the right of 
C in a. Then, Ri only contains nodes from C or from blocks to the left 
of C in a. Thus, d is not to the right of the nodes of D PI Ly in a, since 
Ly CLCC. Moreover, d is not to the left of the nodes of D D in a 
because, otherwise, there would be a contradiction with the definition of 
D. Then, deD. 

Case 3.2.2: Assume that D flLy = and DnRi ^ 0. It suffices to consider 
any d G -D fl -Rj. 

Case 3.2.3: Assume that £> fl Ly = 0, D fl Ri = 0, and some deD was 
a descendant of some r G -R, in Go- Recall that Method B3 is currently 
considering the block C of a, and that it has previously considered all the 
blocks of a to the right of C in a. Then, Ri only contains nodes from 
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G or from blocks to the left of C in a. Then, r was not in the blocks of 
a previously considered, since r G Ri. Therefore, no descendant of r in 
G is currently to the left of r in (3 and, thus, the descendants of r in G 
are descendant of r in G t by Lemma [3, because I{G t ) C J(G ) and f3 is 
consistent with G t . Then, d is a descendant of r and thus of Ri in G t . 
Case 3.2.4: Assume that D fl Ly = 0, D fl i?j = 0, and no node of D was a 
descendant of a node of i?j in Go. As shown in Case 3.2.3, the descendants 
of any node r G Ri in Go are descendant of r in Gt too. Therefore, no 
descendant of r in Go was to the left of the nodes of D in a because, 
otherwise, a descendant of r and thus of Ly in Gt would be to the left of the 
nodes of D in a, which would contradict the definition of D. Recall that no 
descendant of r in Go was in D either. Note also that the nodes of D are to 
the left of the nodes of Ri in a, by definition of D and the fact that DdRi = 
0. These observations have two consequences. First, the components of 
G containing a node from D were still in H when any component of G 
containing a node from Ri became a terminal component of H in Construct 
(3. Thus, Construct (3 added the components of G containing a node from 
D to after having added the components of G containing a node from 
Ri. Second, Construct (3 did not interchange in (3 any component of G 
containing a node from D with any component of G containing a node 
from Ri. 

Recall that Method B3 is currently considering the block G of a, and that 
it has previously considered all the blocks of a to the right of G in a. 
Note that the nodes of D were not in the blocks of a previously considered 
because, otherwise, G and thus the nodes of Ly (recall that Ly CiCC) 
would be to the left of D in a, which would contradict the definition of D. 
Therefore, the nodes of D are currently still to the left of Ri in (3. Note 
that the only component to the left of Ri in (3 that contains a descendant of 
L Y in G t is precisely L Y , because L is the left neighbor of R in (3, L Y C L, 
and (3 is consistent with G t . However, D fl Ly = 0. Thus, D contains no 
descendant of Ly in G t , which contradicts the definition of D. Thus, this 
case never occurs. 

We continue with the proof of Case 3.2. Let pf^ = pf£ U X - Y U ^0 Note 
that X, Y S because, otherwise, pf^ would not be S'-active in G t +\. For the 
same reason, p^ x and pJJ{ are S'-active in G t +i- Note that X and Y must be 
in PciG t {R-i) f° r nne 3 °f Fbmerge to add an edge X — Y. Then, no descendant 
of Ri in G t is in S because, otherwise, there would be an S'-active route pf Y 
between X and Y in G t and, thus, pf^ U pf Y U pJJ^ would be an S-active 
route between A and B in G t +i that would violate the minimality property of 
pffi- Then, there is an S-active descending route p r t d from some r G Ri to some 
d G D in Gt because, as shown, D contains a descendant of Ri in Gt- Then, 
pf^ U X — > X' — rU p\ d is an S-active route between A and d in Gf+io 
Likewise, p^[ UY— )■ Y' — — rU p r t d is an S-active route between B and d 
in Gt+i, where pf Y x denotes the route resulting from reversing p^EB Therefore, 
there are S-active routes p^ d and d between A and d and between B and d in 
G a by the induction hypothesis. 

Recall that we assumed without loss of generality that Gt has a descending route 
from Y to a node E st E = B or E G S. Note that -E is a descendant of Ly in 



10 Note that maybe A = X and/or F = B. 
11 Note that maybe X' = r. 
12 Note that maybe Y' = r. 
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Gt and, thus, E is a descendant of d in G a by definition of D and the fact that 
d G D. Let p^ B denote the descending route from d to E in G a . Assume without 
loss of generality that G a has no descending route from d to B or to a node of S 
that is shorter than p^ E ■ We now consider two cases. 

Case 3.2.5: Assume that E = B. Note that pff is S-active in G a by defini- 
tion and the fact that d ^ S. To see the latter, recall that no descendant of 
Ri in Gt (among which is d) is in S. Thus, p^ d U p dE is an S'-active route 
between A and B in G a . 
Case 3.2.6: Assume that E G S. Let p^, B and pf d denote the routes resulting 
from reversing p^ d and pff. Consider the route p^ d U p^, B between A and 
5 in G a . If this route is S'-active, then we are done. If it is not S-active 
in G a , then d occurs in a collider section of p^ d U p^, B that has no node in 
S. Then, we can replace each such occurence of d with pff U pf d and, thus 
construct an S-active route between A and B in G Q . 
Case 4: Method B3 constructs G t +i from by adding an edge X — > Y due to line 4 
of Fbmerge. Note that X — > Y occurs in p££. We consider two cases. 

Case 4.1: Assume that X — > Y occurs as a collider edge in pf^ v Let pff ± = 
pf_^ U X ->■ Y U jO^El Note that X ^ S because, otherwise, p^ would not be 
S-active in G t +i- For the same reason, pf+ x is S-active in G t +\. Then, there is 
an S-active route p^ x between A and X in G a by the induction hypothesis. 
Let Ri denote the component of Gt in R that Fbmerge is processing when the edge 
X — > Y gets added. Recall that Method B3 is currently considering the block C 
of a, and that it has previously considered all the blocks of a to the right of C in 
a. Then, Ri only contains nodes from C or from blocks to the left of C in a. In 
other words, Ri C Uf =1 Cj \ {X, Y} where k* is the smallest k st X, Y G U^ =1 Cj 
(recall that FeLC C). Therefore, XJL (h Y\ Uf =1 C 5 \ {X, Y} because X and Y 
must be in Pac t {Ri) for line 4 of Fbmerge to add an edge X — > Y. Then, X and 
Y are adjacent in G a because, otherwise, X1gT| Cj \ {X, Y} which would 
contradict that I(G a ) C I(Gt)- In fact, X — > Y is in G Q . To see it, recall that 
Method B3 is currently considering the block C of a, and that it has previously 
considered all the blocks of a to the right of C in a. Then, no block to the left 
of L in /3 has a node from C or from a block to the right of C in a. Note that X 
is to the left of R in (3, because /3 is consistent with G t - Then, X is to the left of 
L in 0, because L is the left neighbor of R in /3 and X ^ L. Thus, X — > Y is in 
G a because Y G L C C. We now consider two cases. 

Case 4.1.1: Assume that pj_^ = Y - ...-Y <- XU pfjf. Note that a node 
in Y — ... — Y is in S because, otherwise, p^ would not be S-active in 
G t +i- For the same reason, pf+ x is S-active in G t +i- Then, there is an 
S-active route p* B between X and B in G a by the induction hypothesis. 
Note that Y — ... — Y is in G a . To see it, note that the adjacencies in 

Y — . . . — Y are preserved in G a since I(G a ) C I(G t ). Moreover, these 
adjacencies correspond to undirected edges in G a , because the nodes in 

Y — . . . — Y are in L since Fel and, thus, they are in the same block of 
G a since L C C. Then, p^ x Ul->F-...-F<-lU is an S-active 
route between A and B in G a . 

Case 4.1.2: Assume that p%{=Y-...-Z<-W\J pjji Note that 
^ S and a node in K — ... — Z is in S because, otherwise, p^ would 



Note that maybe A = X and/or Y = B. 
Note that maybe Y = Z, W = X and/or W 
Z and W — X is covered by Case 4.1.1. 



= B. Note that Y ^ Z or W ^ X, because the case where 
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Method G2H(G, H) 

/* Given two CGs G and H st 1(H) C 1(G), the algorithm transforms G into H 
by a sequence of directed and undirected edge additions and feasible splits and 
mergings st after each operation in the sequence G is a CG and 1(H) C 1(G) */ 

1 Let a denote a chain that is consistent with H 

2 Method B3(G, a) 

3 Add to G the edges that are in H but not in G 



Figure 3. Method G2H. 

not be S-active in G t+ \. For the same reason, pj^f is S-active in G t+ %. 
Then, there is an S'-active route p^ B between W and B in G a by the 
induction hypothesis. Note that Y — ... — Z is in G a . To see it, note that 
the adjacencies in Y — ... — Z are preserved in G a since I(G a ) C I(G t ). 
Moreover, these adjacencies correspond to undirected edges in G a , because 
the nodes in Y — ... — Z are in L since Y £ L and, thus, they are in the 
same block of G a since L C C. Moreover, note that Z and IF are adjacent 
in Cr Q since I(G a ) C I(G t ). In fact, Z IF is in G a . To see it, recall 
that no block to the left of L in (3 has a node from C or from a block to 
the right of C in a. Note that IF is to the left of L in (3, because (3 is 
consistent with Gt- Thus, Z IF is in G a since Z £ L C C. Then, 
p^ x UX-^y-...-2^iyu pjf B is an S'-active route between A and 
B'mG a . 

Case 4.2: Assume that X — > Y occurs as a non-collider edge in p^S- The proof 
of this case is the same as that of Case 3.2, with the only exception that X — Y 
should be replaced by X — > Y. 

□ 

We are now ready to prove the main result of this paper, namely that the extension of 
Meek's conjecture to CGs is true. The proof is constructive in the sense that we give an 
algorithm that constructs a valid sequence of operations. The pseudocode of our algorithm, 
called Method G2H, can be seen in Figure [3J The following theorem proves that Method 
G2H is correct. 

Theorem 1. Given two CGs G and H st 1(H) C 1(G), Method G2H(G, H) transforms G 
into H by a sequence of directed and undirected edge additions and feasible splits and mergings 
st after each operation in the sequence G is a CG and 1(H) C 1(G). 

Proof. Note from line 1 that a denotes a chain that is consistent with H. Let G a denote 
the MI map of 1(G) relative to a. Recall that G a is guaranteed to be unique by Lemma [TJ 
because 1(G) is a graphoid. Note that 1(H) C 1(G) implies that G a is a subgraph of H . To 
see it, note that 1(H) C 1(G) implies that we can obtain a MI map of 1(G) relative to a by 
just removing edges from H. However, G a is the only MI map of 1(G) relative to a. 

Then, it follows from the proof of Lemma H] that line 2 transforms G into G a by a sequence 
of directed and undirected edge additions and feasible splits and mergings, and that after 
each operation in the sequence G is a CG and I(G a ) C 1(G). Thus, after each operation in 
the sequence 1(H) C 1(G) because 1(H) C I(G a ) since, as shown, G a is a subgraph of H. 
Finally, line 3 transforms G from G a to if by a sequence of edge additions. Of course, after 
each edge addition G is a CG and 1(H) C 1(G) because G a is a subgraph of H. 
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